The Bay Area is one of the wealthiest regions in the world. Attracting burgeoning tech startups that become successful, as well as the wealth that comes with it, the Bay Area is also home to four of the top ten most expensive zip codes in the United States according to CNBC: Atherton, Palo Alto, Los Altos, and Ross, with Atherton coming out as the most expensive zip code for the third year in a row, and the wealthiest zip code in the world. However, less than six miles away from Atherton is East Palo Alto and North Fair Oaks, which, along with Oakland and Richmond, are some of the less affluent cities in the Bay Area. Not only is there stark income inequality in the Bay Area, there is also edcuational inequality. According to Niche.com, Palo Alto and Los Altos have the second and third best school districts, while East Palo Alto has the worst school district. The goal of my project is to look at why such stark inequality, both income and educational, exists in the Bay Area, especially between cities so close together.
Looking at both maps, it is evident that there is stark income inequality and educational inequality in the Bay Area, even between zip codes that are right next to each other. It is also very clear that, from looking at the plot, there is a clear correlation between educational attainment and median household income. However, one reason this might occur is because families who don’t speak English fluently have a harder time finding a job and making money, which leads to a lower educational attainment for their children, and continues the cycle of poverty. The Bay Area is a very diverse place, and areas like Atherton have a very high percentage of fluent English speakers, while areas like East Palo Alto have a high percentage of immigrants, whose first language is not necessarily English.
##
## Call:
## lm(formula = perc_college ~ perc_over100k + perc_fluent_english,
## data = bay_multiple_tract)
##
## Residuals:
## Min 1Q Median 3Q Max
## -0.25494 -0.04300 0.00166 0.04213 0.37903
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 0.091801 0.007845 11.70 <2e-16 ***
## perc_over100k 0.327934 0.011989 27.35 <2e-16 ***
## perc_fluent_english 0.631483 0.012488 50.57 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.07004 on 1574 degrees of freedom
## Multiple R-squared: 0.8222, Adjusted R-squared: 0.822
## F-statistic: 3640 on 2 and 1574 DF, p-value: < 2.2e-16
Looking at the 3D plot, we can see that there is a clear correlation between educational attainment, income, and language spoken at home. The two plots reinforce this correlation, with the best-fit lines showing an even clearer picture of the strong correlations between English fluency, educational attainment, income. Both show a very strong correlation, especially the plot comparing English fluency and educational attainment, where the best-fit line matches the trend of the data very closely. Furthermore, the model shows that the variables and model are significant; there is a high r-squared value, so 82% of variance is explained. There is also a very low p-value (p<2.2e-16) and a high F-static, meaning that the variable and model are significant, and that the correlation is strong.
After using Mahalanobis matching, and treating Atherton (94027) and North Fair Oaks (94063) based on Median Household Income and Educational Attainment, San Francisco (94127) and Coloma (94014) emerged as similarly matching zip codes that were also similar in proximity (less than 6 miles apart), as shown by the two maps. The above plot on the left illustrates the stark difference in both income and educational attainment between the four zip codes, with Atherton and San Francisco being the more affluent and educated, and North Fair Oaks and Coloma being the less affluent and educated, despite their close proximity to Atherton and San Francisco. The plot on the right demonstrates how primary language plays a role, as Atherton and San Francisco, being the wealthier and more educated cities, both have low percentages of non-English speakers, while North Fair Oaks and Coloma, being the less affluent and less educated cities, have higher percentages of non-English speakers (greater than 60%).
All in all, the Bay Area is home to a lot of wealth and extremely educated people, but also a lot of income and educational inequality. Through multiple variable regression, as well as 3D plotting, a correlation was established, between one’s primary language, and income and educational inequality, where it is hypothesized that when one’s primary language is not English, it make school harder, which makes graduating, achieving higher education, and finding a job much more difficult, which in turn makes earning income a bigger challenge. Through Mahalanobis matching, San Francisco and Coloma were found as similar zip codes to Atherton and North Fair Oaks in terms of educational attainment and income. A case study of comparing the two reinforced the hypothesis that one’s primary language has affects income and educational attainment, as the wealthier cities (San Francisco, Atherton) had lower percentages of people who didn’t speak English well, while the less affluent cities (Coloma, North Fair Oaks) had higher percentages of non-English speakers. In order to combat both income inequality and educational inequality, and make the Bay Area a more just place and a place of equal opportunity, education and jobs must be accessible for people of all backgrounds, including those who’s first language is not English. Furthermore, there must be accommodations made for non-English speaking students, in order to assist them in learning to their potential, instead of being restricted by a language barrier. Part of the beauty and success of the Bay Area comes from it’s diversity, but we can’t allow non-English speaking members of the Bay Area to continue facing income and educational inequality, simply because English was not their first language.